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Abstract. A simple model to handle the flow of people in emergency evacuation situations 
is considered: at every point x, the velocity U(a;) that individuals at x would like to realize is 
given. Yet, the incompressibility constraint prevents this velocity field to be realized and the 
actual velocity is the projection of the desired one onto the set of admissible velocities. Instead 
of looking at a microscopic setting (where individuals are represented by rigid discs), here the 
macroscopic approach is investigated, where the unknown is a density p(t,x). If a gradient 
structure is given, say U = — VD where D is, for instance, the distance to the exit door, the 
problem is presented as a Gradient Flow in the Wasserstein space of probability measures. The 
functional which gives the Gradient Flow is neither finitely valued (since it takes into account 
the constraints on the density), nor geodesically convex, which requires for an ad-hoc study of 
the convergence of a discrete scheme. 

Crowd motion; Gradient Flow; Wasserstein distance; Continuity equation. 

1. Introduction 

In the last two decades, several strategies have been proposed to model the motion of pedes- 
trians. Most of them rely on a microscopic approach: the degrees of freedom are the positions 
of individuals, and their evolution depends on a balance between selfish behaviour, congestion 
constraints, and possibily social factors (politeness, gregariousness) . Among those microscopic 
models, some are based on a stochastic description of the individual behaviour (see e.g. [27j). 
whereas others are purely deterministic (see |24 | I28 | [2"U]). 

An essential ingredient in those models lies in the way interactions between individuals are 
handled, in particular in the case of high density (congestion phenomena). Following the clas- 
sification which holds in the modelling of granular flows, one can differentiate the Molecular 
Dynamics (MD) approach (the non overlapping constraint between rigid grains is relaxed, and 
handled by a short range repulsive force) and the Contact Dynamics (CD) one (the collisions 
are explicitely taken into account). In the context of pedestrians, MD strategy has proved to be 
quite efficient to model congestion. In particular Helbing[22~l [24"] [26] introduced the concept of 
social forces, which are designed in such a way that individuals tend to repel each other when 
their distance drops below a certain value. The model proposed in |34] relies on the alternative 
strategy: individuals do not interact with each other as soon as they are not in contact, and the 
non overlapping constraint is treated in a strong (non relaxed) way. Although it is natural to 
expect some link between the two approaches (MD models are likely to converge in some way 
to their CD counterparts as the repulsive force stiffness goes to infinity), it is to be noticed that 
the mathematical structures of the two classes of models are quite different. In the first case, 
Cauchy-Lipschitz theory for ODE's applies, whereas CD models present some analogies with the 
so-called sweeping process introduced by Moreau[36] in the 70', for which a dedicated framework 
has been developped (see |20] . |21] . [35]). 

In the case of macroscopic models, the first strategy (congestion is treated in a relaxed way) 
is favoured, as it allows to use classical methods for studying PDE. For example, crowd motion 
models inspired from traffic flow models have been developped (see Refs. \12\ I13|). They 
take the form of hyperbolic conservation laws, and they are essentially monodimensional in 
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space. In higher dimension, Bellomo and Dogbe[4j [19] proposed second order models, where a 
phenomenological relation describes how the crowd modifies its own speed: 

/ dtp + V x • (pv) = 

\ a 4 v + (v-V x )v = F(p,v). 

Typically, the motion is governed by F, which has two parts: a relaxation term toward a definite 
speed, and a repulsive term to take into account that pedestrians tend to avoid high density 
areas. Degond[T8] uses the same approach to model sheep herds. In this model, the term F 
depends on a pressure which blows up when the density approaches a given congestion density 
(barrier method). There also exist first order models, where the velocity field is directly defined 
as a function of the density (see e.g. |30| \31\ 115]). Another class of models is described by 
Piccoli and Tosin in Refs. [38], [39]. They propose a time-evolving measures framework, where 
the velocity of the pedestrian is composed by two terms: a desired velocity and an interaction 
velocity. The last one models the reaction of the pedestrian to the other surrounding pedestrians 
(namely, people can deviate from their preferred path if they enter a crowded area). 

To our knowledge, as the ones presented above, all macroscopic models rely on a relaxed ex- 
pression of the congestion. Let us mention however the work of Buttazzo, Jimenez and Oudet in 
[9], where the optimal transportation between two given densities is computed under constraints 
(obstacles, congestion, ...) which can be strongly expressed. Yet, this approach is very different 
from the model we describe later, since its goal is to find an optimal transport between den- 
sities as in the work of Benamou and Brenier[B] (which is the classical reference for dynamical 
formulations of transport problems), whereas optimal transportation is in our case a very suit- 
able tool. Moreover, we will mainly make use of the distance that optimal transport induces on 
probability measures rather than looking at the optimal maps themselves, as we will see after a 
brief description of the model we consider. 

The macroscopic model we present here is based on a strong expression of the congestion 
constraint. It is a natural extension of the microscopic approach proposed in Refs. |34 [ [35l Rd] . 
which we describe here in its simpler form. The crowd configuration is represented by the 
position vector q = (qi, . . . , qjv). Each of the N individuals whishes to have a velocity Uj 
which depends on its position only: Uj = U(qj), where U(-) is some given velocity field over 
]R 2 (typically U = -VD, where D is the geodesic distance to the exit). To account for non- 
overlapping, it is assumed that the actual velocity u = (ui,...,U/v) is the £ 2 -projection of 
U = (Ui, . . . , Uat) = (U(qi), . . . , U(q/v)) onto the cone of feasible velocities C q (i.e. the set of 
velocities which do not lead to a violation of the non-overlapping constraint). The model takes 
the form 

( dq 

~r = u 

(1) ! dt 

. u = P Cq U. 

In the spirit of this microscopic approach, the model we propose here rests on the two following 
principles 

(1) the pedestrian population is described by a density p which is subject to remain below 
a certain maximal value (equal to 1 in what follows), this density follows an advection 
equation, 

(2) the advecting field u is the closest, among admissible fields (i.e. which do not lead to 
a violation of the constraint), to some spontaneous field U, which corresponds to the 
strategy people would follow in the absence of others. 

If we denote by C p the cone of admissible velocities (i.e. set of velocities which do not increase 
density in already saturated zones, see next section for a proper definition), the model takes the 
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following form 



(2) 



dtp + V-pu = 

u = P Cp U, 



where the projection is meant in the 1? sense. As a matter of fact, in the same way as Cauchy- 
Lipschitz theory for ODE's no longer applies for CD in the microscopic case, we cannot use 
classical methods to study Equation (|2|), as well as most of the PDE's we could encounter in 
the CD macroscopic models. This is due in particular to the lack of regularity of the velocity u 
(whose natural regularity is L 2 ), which prevents us to apply the characteristic method or even 
DiPerna-Lions theory. The non-continuous dependence of the operator Pq with respect to p is 
another source of problems. 

Instead, we will see that this PDE corresponds to a Gradient Flow in the Wasserstein space 
(i.e. the space of probability measures endowed with the distance W2 induced by the optimal 
transport under quadratric cost), provided that the spontaneous velocity field has a gradient 
structure: U = — VD. This means that we consider the functional 



and we look for the curve of measures p(t, .) which follows the steepest descent direction of $ 
starting from a given datum p°. This curve will happen to solve equation ([2]). This is a general 
and very efficient method to find solutions to certain evolution PDE's which been made possible 
by the theory of optimal transportation. This theory owes its origin to Kantorovich|33|. but has 
been widely developped thereafter (see the books by ViHani[i2|l43|). Several equations have been 
approached by this method, for instance the classical heat equation, as well as the Fokker-Planck 
or the porous media equations (see Refs. |32|. IBT) 111]). Notice that, as the functional which is 
used to produce the porous media equation as a Gradient Flow is 



our case can be considered as its formal limit when m tends to infinity. All the theory of Gradient 
Flow in Wasserstein Spaces is treated in the reference book by Ambrosio-Gigli-Savare[2] and one 
of the key assumptions is the A-convexity of the functional, which ensures better estimates. On 
the other hand, some existence results can be obtained without this assumption, but they have 
to be treated carefully by hand, as it happens in Ref. [7j. In our case, even if we suppose D to 
be A-convex, we face the same kind of difficulties if we want to add the presence of an exit door 
on the boundary of f2 where the measure can concentrate (see Section [2]). 

The paper is organized as follows: In Section [2] we present the model in the Eulerian setting 
and a related discrete minimizing movement scheme (MMS). We explain how a straightforward 
use of a convergence theorem in [2] asserts a convergence of the trajectories for the discrete MMS 
to some continuous pathline. Identification of this limit with a solution to the initial problem can 
be done unformally. Yet some technical obstacles (in particular the handling of walls) prevent us 
from obtaining a fully rigorous proof based on this approach. The actual proof of convergence to 
a solution of the crowd motion model is based on alternative arguments. The end of this section 
describes this convergence results. As the presence of an exit raises some very specific technical 
difficulties, we propose in Section [3] a proof in the case there is no exit. The proof in the general 
case in given in Section |H To illustrate the convergence theorem, we present in Section 5 an 
idealized (yet non trivial) situation where both eulerian solutions and discrete MMS trajectories 
can be described with accuracy. Finally, we discuss in Section 6 the limitations of this model 
and its possible extensions to other fields of natural sciences. In particular, we explain why we 
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developped the theory in any dimension although dimensions greater than two do not make clear 
sense as far as crowd motion is concerned. 

2. The eulerian model and its gradient flow formulation 

2.1. Eulerian model. The model we propose is designed to handle emergency evacuation situ- 
ations : the behaviour of individuals is based on optimizing their very own trajectory, regardless 
of others, but the fulfillment of individual strategies is made impossible because of congestion. 

The model takes the following form: given a domain f2 (the building), whose boundary T is 
composed of T ou t (the exit) and T,,, (the walls), we describe the current distribution of people by 
a measure p of given mass (say 1 without loss of generality) supported within Q. To model the 
fact that people getting through the door are out of danger, yet keeping a constant total mass 
without having to model the exterior of the building, we shall assume that p may concentrate 
on T ou t- In this spirit, we denote by K the set of all those probablity measures over M? that are 
supported in 0, and that are the sum of a diffuse part, with density between and 1, in f2, and 
a singular part carried by T out . 




Figure 1. Geometry. 

We shall denote by U the spontaneous velocity field: U(x) represents the velocity that an 
individual at x would have if he were alone. It is taken equal to outside Q. The set C p of 
feasible velocities corresponds to all those fields which do not increase p on the saturated zone 
(unformally, V ■ u > in [p = 1]), and which account for walls (people do not walk through 
them). As we plan to define C p as a closed convex set in L 2 (f}), those constraints do not make 
sense as they are, and we shall favor a dual definition of this set. Let us introduce the "pressure" 
space 

Hp = {?€ H 1 (0) , q > a.e. in n , q(x) = a.e. on [p < 1] , q ]Tout = 0}. 
The proper definition of C p reads 

(3) c7 p = {veL 2 (ft) 2 , fvV?<0 VgetfH. 

Jo, 

The model is based on the assumption that the actual instantaneous velocity field is the feasible 
field which is the closest to U in the least-square sense, i.e. it is defined as the L 2 -projection of U 
onto the closed convex cone C p . Finally the problem consists in finding a trajectory 1 1— > p(t) € K 
which is advected by u, i.e. such that (p, u) is a (weak) solution of the transport equation in R 2 

(4) d t p + V • (pu) = 0, 
where u verifies, for almost every t, 

(5) u = P Cp U. 
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Remark 2.1. The fact that Y ou t is likely to carry some mass calls for some proper definition of 
the velocity on this zero-measure set. As the exit plays the role of a reservoir in our model, we 
shall actually consider that all feasible fields vanish on T ou t, so that velocity u given by © will 
be considered as defined Lebesgue-a.e. in Q and vanishing on T ou t- 

Remark 2.2. Boundary conditions (walls and exit). 

The unilateral divergence constraint and the behaviour at walls and exit are implicitly contained 
in the dual expression of C p , as illustrated by the following considerations. We assume in this 
remark that [p = 1] = uJ where w C is a smooth subdomain, and that all fields are smooth. 
First of all, by taking tests pressures which are smooth and compactly supported in w, we obtain 
V • u > in the saturated zone. As the pressure vanishes on T ou t, the velocity is free on that 
part of the boundary (free outlet condition, as in Darcy flows). Consider now a situation where 
the saturated zone covers the wall r„,. For any smooth function ip defined on consider a, 
sequence of extensions ip e supported within u) U T W1 which converges to in L 2 (tt). Then 

u • Vv% < Ve > 

n 

implies 

- / tp £ V • u + / ip £ u • n < Ve > 0. 

As the first term goes to with e we obtain that the velocity necessarily enters the domain on 
the saturated wall (what we adressed before as "people do not walk through walls"). 

2.2. Gradient flow formulation. In this section we introduce a discrete evolution problem 
in the Wasserstein space, whose limit will be the gradient flow of a suitable functional, and we 
establish unformally the link between this new problem and the crowd motion model. The formal 
equivalence, which will be proved rigorously in the following sections, will be satisfied in the case 
where U = —VD is the opposite of a gradient. 

Let us denote by V2 the set of probablity measures over R 2 endowed with the Wasserstein 
distance, and by 

(6) K = {p G V2 , supp(p) C O , p = p ou t + pn , pn(x) < 1 a.e., supp(/w) C T out } 

the set of feasible densities. Let an initial density p° be given, and r > a time step. We build 
p° = p , p\ , . . . as follows 

(7) p k T = argmin \,J(p) + I K (p) + (p,^ 1 

P 2 (K d ) l ZT 

where W2 is the Wasserstein distance, J is the dissatisfaction functional defined as 

(8) J{p) := [ D(x)p{x)dx, 

Jn 

and Ik is the indicatrix of K : 

if p G K 
+00 if p £ K. 

The function D is typically the distance to the door T ou t, and to D we associate a vector field 
U = — VD. It is important in order to have vanishing velocities on the door that D is minimal 
and constant on T ou t- 

We admit here that under reasonable assumptions this process is indeed an algorithm (i.e. p k+1 
is uniquely defined as the minimizer of the function above), and we denote by p T the piecewise 
constant interpolate of p®, p\, .... 
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As r goes to 0, by Prop. 2.2.3, Th. 2.3.1, and Th. 11.1.3 in [2], p T converges to some trajectory 
1 t— > p in K, which is a (weak) solution to 

dtp + v • ( P u) = o, 

where u is such that, for almost every t, 

u€-d(J + I K )(p), 

where d^f denotes the strong subdifferential of ty. Furthermore u minimizes the L 2 norm among 
all those fields in the subdifferential above. 

Let us now prove unformally that this characterizes the instantaneous velocity as the projection 
of U = — VD onto C p . The subdifferential of a function \t at p in the Wasserstein setting is 
defined as the set of fields u such that 

+ / (u,t(x)-x)dp(x)<*(t # p) + o(||t-i||), 
Jn 

where t denotes a transport map acting on p. Note that the previous inequality does not provide 
any information as soon as t#p is not feasible (in that case the right-hand side is +oo). Let 
us consider a feasible field v £ C p , and let us assume that, for e small enough, t £ = i + ev 
pushes forward p onto a measure in K (this is not true in general, see Remark 12. 3p . Note that 
t e is defined p-almost everywhere, with T ou t carrying some mass, but as it vanishes on T ou t (see 
Remark 12. ip . the singular part of p remains unchanged. Having e go to in the subdifferential 
inequality, we obtain 

J VD -\dp{x) + j u -vdp(x) < 0, 

so that u + VD = u — U belongs to C°, the polar cone to C p . As u minimizes the L 2 norm over 
U + Cp, u identifies with the projection of U onto C p , which ends this unformal proof. 

Remark 2.3. In general, there exist feasible densities p £ K (defined by ((6])) and fields v £ C p 
(defined by ([3])) such that (i + ev)#p exits K for any e > 0, this is why the considerations 
above do not make a rigorous proof. Consider for example u) a dense open subset in Q, with a 
small measure, and define p as l^c The pressure space is {0}, and C p is L 2 (Q): any field is 
feasible. If one considers now a strictly contractant field (with negative divergence), it is clear 
that (i + ev)#p ^ K for any e > 0. Notice also that this kind of paradox does not depend on the 
fact that we chose a "linear" perturbation (i + ev), since the same would happen if one, instead, 
perturbs the identity by following the flow of the vector field v for a time e (which is classically 
a better choice in order to satisfy the density constraint). 

As explained in the previous remark, the approach carried out in this section is not a rigorous 
proof that the advecting field is actually the projection of U onto C p . We conjecture that 
projecting (i + ev)^ p onto K (for the Wasserstein distance) introduces a perturbation which is 
negligible compared to e, so that v may actually be used as a test-function, but this conjecture 
raises some technical issues which we were not able to solve. In what follows we give an alternate 
proof which circumvents the necessity to characterize d(J + Ik)- 



2.3. Notations and statement of the main result. We first recall some results on the 
continuity equation: let (p(t, -))t>0 be a family of density measures on W d , and v : (t,x) £ 
R + x R d H- v(t, x) £ M. be a Borel velocity field such that 

(9) / / \v(t,x)\p(t,x) dx < +oo. 

Jo JR d 
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We say that (p, v) satisfies the continuity equation with initial condition p° 
(10) f^ + V.(pv) = 

\ p(o,.) = p° 

if for all 99 G C~([0,T[xR d ) we have 



(11) / / (d t ip(t,x) + V<p(t,x) ■ v(t,x)) p(t,x)dx + [ <p(0,x)p°(x)dx = 0. 

Let us recall that if p(t, .) is a solution of the continuity equation, there exists a narrowly con- 
tinuous curve pit, .) such that p(t, .) = pit, .) for a.e. t. In general, we will always focus on this 
continuous representation. 

We now detail the construction of a discrete family of densities (pt^) that approches in a sense 
we will precise later the solution of the continuity equation we are interested with. For a fixed 
time step r > 0, we define the sequence (p^) of density measures on f2 using the recursive scheme: 

( P° T = P° 

(12) ) P k r e a vgmm\jip)+I K (p) + ^W^p,p l ;~ 1 )), 



V 2 ( 

where W2 is the Wasserstein distance, and J is the dissatisfaction functional defined in ([8]). 

This construction is a minimizing movement scheme as described by DeGiorgi and Ambrosio 
in |17| Q] and then - in the framework of probability measures - in [2J [S] with functional <3?(/o) = 
J{p)+l K {p). 

i — t k 

We define on fl the discrete velocities: = -, where t^: is the unique optimal transport 

T 

function from p k to p^" 1 , which is well defined on Cl (but not necessarely on T ou t, due to the 
singular part of p k ). We also define = p^v^ on Q (by abuse of notation, we will write Cl instead 
of when we want to stress that we are not considering the boundary). We can interpolate these 
discrete values (p k , v^, E^)fc> by the piecewise constant functions defined by: 




(13) { v T (i,.)=v* if*€](fc-l)r,fcr]. 



Our goal is to prove that p T converges when r — > to a solution of the continuity equation (|10p . 
Here is our main result: 



Theorem 2.4. Let £1 be a convex bounded set ofM. d , D : R d — > M a continuous X- convex function, 
p° a probability density, and (p k ) constructed following the recursive scheme M2\) . 
Then there exists a family of probability densities (p(t, .))t>o, and a family of velocities (u(i, .))j>o 
such that (p T (t, .),E T (i, .)) narrowly converges to (p(t, .), p(t, .) u(t, .)) for a.e. t. Moreover, (p, u) 
satisfies the continuity equation: 

' d tP + v.(pu) = 

(14) \ u(i,.) = P Cp(t ,U for a.e. t 

( p(Q,.) = P° 

where U = —\7D, andC p ( t ^ is defined in f3|). 

We will first prove this theorem in the particular case where there is no exit. In the following 
section, we thus assume that T ou t = 0, which will imply that all the measures are absolutely 
continuous with respect to the Lebesgue measure. Then we will extend the proof to the general 
case. 
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Remark 2.5. We chose to assume here a A— convexity hypothesis on D both in order to clarify 
some statements, which are easier to state and prove under this assumption (see for instance 
Lemma 13.11 and the subsequent Remark I3.2p and because the typical case we think of is D = 
d(-,T ou t), where T ou t is a flat part of the boundary of the convex set £1. This implies that D is 
convex as well. It would be interesting to study the case of non-convex domains fi (for instance 
with obstacles), and use the geodesic distance for computing D, which would lead to a non- 
A— convex function, but this is not yet possible by means of our techniques, since one should 
work with the Wasserstein distance Wi computed w.r.t. the geodesic distance itself, which is not 
much studied. 

Anyway, it can be checked that the only point throughout the paper where A— convexity is 
used is the proof of Lemma [3. 11 but Remark 13.21 explains how to get rid of this assumption: this 
means that, for existence purposes, this assumption may be withdrawn. On the other hand, the 
A— convexity assumption is typical in this gradient flow problems, because it allows for uniqueness 
and stability results, and we think that similar results could be achieved in our case as well. 



3. Existence result in a domain with no exit 

3.1. Technical lemmas. Since we will make a strong use of optimality conditions in terms of 
the dual problem in Monge-Kantorovitch theory, let us briefly recall what we need. 
Given the two probabilities p and v on we always have 

^W|(m,z/) =max$ J^<pd(x+ J J>dv, C°(Tl) : <f>(x) + i>(y) < \\x-y 

the maximum being always realized by a pair of c— concave conjugate functions (</?, ip) with 
ip = ip c and ip = ip c , where the c— transform of a function x ls defined through 

I 

X c (y) = inf -\x-y\ 2 - 
x£U Z 

(with generalizations to other costs c rather than the square of the distance). We will call 
Kantorovitch potential from pto v (resp., from v to p) any c— concave function (p (resp., ip) such 
that (ip,ip c ) (resp., (ip c ,ip)) realizes such a maximum. We have uniqueness of the optimal pair 
as soon as one of the support of one of the two measures is the whole domain £1. 

Lemma 3.1. Let D : R d i— > R X-convex, and p £ K . Then, there exists r* such that for all 
r < r : 

(i) The functional <fi(p) = 3>(p) H Wf (p,p) admits a unique minimizer p m . 

(ii) There exists a Kantorovitch potential (p from p m to p, such that: 

(15) J (D + ^p> J (D + ^jp m for all p<l a.e.. 



Proof, (i) The existence of a minimizer can easily be proved using a minimizing sequence of 4>(p). 
Let pi, P2 be two different minimizers, and rj the optimal transport between p and p^. We define 

r t := (1 - t)ri + tr 2 and p t := r t #pi, for t G 10, 1[. We know that p t = . , ^= — r o (r t ) _1 . 

|detvrt| 

As M i— > (detM) -1 is convex on the set of positive definite matrices , and Vri S , we 
have : 
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We also know that p\ and p 2 are admissible, therefore: < 1 a.e., and we obtain: pt < 1. 

det Vri 

We have then 

cp(p t ) = J D((l - i)n(x) +tr 2 (x))p(x) dx + ^Wi(p t , p). 
Since D is A-convex 

D((l - t)n(x) + tr 2 (x)) < (1 - t)D{n{x)) + tD(r 2 (x)) - ^(1 - t)|n(x) - r 2 (x)| 2 . 

Moreover, V^^p) is 1-convex along the interpolation pt (see lemma 9.2.1 p. 206 in Ref [2]), 
therefore, for r small enough, we have 

<f>{pt) < (1 - t)<j>{ Pl ) + t<P{p 2 ) = inf 

K 

which is absurd. 

(ii) We first assume that p > a.e., which implies that the Kantorovich potential <p from p m to p, 
satisfying <p(xo) = (with xo any fixed point in Q), is unique. Let us define a small perturbation 
of p m : let p < 1 be a probability density, e > and p £ : — p m + e(p — /) m )- As /3 m minimizes 
0(p), we have: 

(i6) j(p e ) - J(p m ) + ^(wi(p E ,p) - wi{pm,p)) > 0. 

The first part of the left side of the inequality can easily be calculated: 

J(p £ ) - J(pm) = \ D(x)(p £ - p m )(x)dx = e / D(x)(p - p m )(x) dx . 

Jn Jn 
Let us estimate the second part: we denote by ((p e ,ip e ) some Kantorovich potentials associated 
to p and p e . We have 

\w^{p £l p) = i ip e (x)p £ {x)dx + / ip e {y)p(y) dy 
* Jn Jn 

7i W KPm,P) > / ip £ (x)Pm(x)dx + / ip £ (y)p(y) dy, 
^ z Jn Jn 

where <p e is a Kantorovitch potential from p £ to p. Thus: 

7;( w i(Pe,p) ~ Wi{p m ,p)) < [ ip £ {x)(p £ - p m )(x) dx = e [ (f £ (x) (p - p m ) (x) dx, 
z Jn Jn 

and we can deduce from inequality (|16p that: 

I D(x)(p — p m )(x) dx H — J ip £ (x)(p — p m )(x) dx > for all admissible p. 
Jn T Jn 

Let e tend to 0: (p £ converges to the unique Kantorovich potential (p from p m to p. This gives 

/ D(x)(p — p m )(x) dx H — / ip c (x)(p — p m )(x) dx > for all admissible p. 
Jn T Jn 

We now prove the general case: let pg > a.e., p$ < 1 a.e., such that p$ converges to p when 5 

tends to 0. Using (i), there exists a unique minimizer p m $ of <j>s(p) '■= / D P + Ik-\ W^ip, ps), 

Jn 2r 

and it converges to p m as 5 tends to 0. Moreover, we have proved that: 

J D(x)(p — Pm,s){x) dx H — j ip$(x)(p — p m ,s){x) dx > for all admissible p, 
Jn T Jn 

with (ps that converges to a Kantorovitch potential (p. Taking the limit 6 — > 0, we obtain the 

desired inequality. For this kind of arguments concerning optimality for transport costs and 

other functionals, see for instance Ref. 1101. □ 
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Remark 3.2. if D is not A-convex, we cannot prove uniqueness of the minimizer of (p. However, 
if p m is a minimizer, it still satisfies inequality (ii). Indeed, in the second part of the proof of (ii), 
we can define p m $ as a minimizer of <fi$(p) + csW^ip, p m )> where c$ — > (so that the optimality 
condition that we see at the limit 5 — y disregards this term) , but slowly (so that it makes Pm,5 
converge to p m ). Obviously this kind of argument was not necessary if one only wanted to prove 
this optimality condition for one minimizer Prm and not for every minimizer. 



Lemma 3.3. (Decomposition of the spontaneous velocity): 
The spontaneous velocity U = -VD can be written as follows: 

(17) U = v k + Vp k withp k eH] k . 

It 



Proof. Using the previous lemma, we know that there exists a Kantorovich potential (p from p k 
to p k_1 such that p k is a solution of the minimizing problem: 



p k G argmin <j / D(x)p(x) dx -\ — / (p(x)p(x)dx 



T 



which imposes: 



p k T = 1 on [F < 1} 
p k <l on [F = I] with F : 
p k = on [F>1], 

and ieK chosen such that ^satisfies: f „Jds = 1. 

Jn 

We can then define a pressure like function: 

(18) p k {x) := (I - F(x)) + =(l- D(x) - ^ 



Q -> R 

x i — ^ D{x) + ^. 



which satisfies: p k > 0, and p k = on [p k < 1], therefore p k G H^ k . 

Moreover, on [p k > 0], we have: Vp k = —VD — (where the density vanishes v^ may be 

r 

modified at will, so that we can keep the same formula). Since we have 



v T 



T T 



we get the desired decomposition for the spontaneous velocity : U = v£ + Vp£. □ 
Let us now define the densities p T (t) that interpolate the discrete values (p k ) along geodesies: 



(19) /*(*)= ^ \ ' (id~t*)+t*j p*. 

We also define vy(i, .) as the unique velocity field such that v T (i, .) G Tanp t T-^O^) an d (Pti v t) 
satisfy the continuity equation. As before, we define: E T = p T v T . 

After these definitions we will give some a priori bounds on the curves, the pressures and the 
velocities that we defined. In order to get these bounds, we need to start from some estimates 
which are standard in the framework of Minimizing Movements. The sequence {p k )k satisfies 
an estimate on its variation which gives a Holder and H 1 behavior. From the minimality of p k , 
compared to p k ~ l , one gets 

WKp k T ,P^ 1 )<2r^{p k )-^(p k - 1 )). 
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Since $ coincides with J, which is bounded, on the sequence (p^)fc, then we have W 2 (p^, P^~ l ) < 
Ct (discrete Holder behavior), but we also have, if we sum up over k 

(20, I>( Wr ' ( *'*'' ) )'<2»(A 

which is the discrete version of an H 1 estimate. As for p T (t), it is an absolutely continuous curve 
in the Wasserstein space and its velocity on the time interval [(k — l)r, kr] is given by the ratio 
W 2 (j%~ 1 ,tf)/T. Hence, the L 2 norm of its velocity on [0, T] is given by 

(2D /V fo(t) *_ E »2«^). 

Jo k T 

and, thanks to (|20p . it admits a uniform bound independent of r (here we use the notation |cr'|(t) 
for the metric derivative of a curve a and \a' \w 2 (t) means that this metric derivative is computed 
according to the distance W<i). This gives compactness of the curves p T , as well as an Holder 
estimate on their variations (since H l C C ' 1 / 2 ). 

Lemma 3.4. (A priori estimates): 

We have the following a priori estimates: 

(i) v r is T-uniformly bounded in L 2 ((0,T),L 2 t (Q)). 

(ii) p T is T-uniformly bounded in L 2 ((0, T), iJ 1 (J7)). 
(Hi) E r and E r are T-uniformly bounded measures. 

Proof, (i) We have the following equalities: 

rT r pkr r 

= W / P k T \<\ 2 
o Jn k J{k-i)T.Jn 

\ / r \rr. ffc/„\|2 



k 



(fc-l)r J \Jn 
2f n k-l 



2^r - 2 = -^W 2 ( Pr , Pr ). 



T T 

k 



Thanks to the general estimate (|20p we get / / p r |v T | 2 < 2<fr(p°). 

Jo Jn 

(ii) Since we have shown the following decomposition: Vp T = —X7D — v T , we have: 

/ / Pt\VPt\ 2 < 2 [ [ p T \v T \ 2 + 2 [ [ p T \VD\ 2 < C. 
Jo Jn Jo Jn Jo Jn 

But p T = on [p T < 1] , therefore / / |Vp r | 2 = / / p T \Vp T \ 2 < C. 

Jo Jn Jo Jn 

(hi) We look at E r and we use the estimates ([20]) and (|2ip . 



E T | = / / Pr|v T | < [ ( [ Pt\v t \ 2 Y ( [ PrY < f ( f Pt|v t ' 2 

o Jn Jo Jn Jo \Jn J \Jn J Jo \Jn 



< VT / pJvJ 2 < C. 



o Jn 



Therefore, E r is a T-uniformly bounded measure. The proof for E r is almost the same, estimating 
L 1 norms with L 2 norms by Cauchy- Schwartz. □ 
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Lemma 3.5. Assume that \i and v are absolutely continuous measures, whose densities are 
bounded by a same constant C. Then, for all function f G H 1 ^), we have the following inequality: 

(22) / fdfa-u) < Vc\\Vf\\ L2(Q) W 2 (»,v). 

Jn 



Proof. Let fit be the constant speed geodesic between p, and u, and w< the velocity field such that 
(/i,w) satisfies the continuity equation, and | l w t||i 2 (^tt) = ^^(a 4 )^)- F° r au IH is absolutely 
continuous, and its density is bounded by the same constant C a.e.. Therefore: 



f fd(p-u) = [ 4(/ f(xWt(x)) dt = [ [ Vf-WtdfHdt 
Jn Jo at \Jn J Jo Jn 

< (ft \Vf\ 2 dp t dt) ( f 1 [ \w t \*dfitdt) ' 
\Jo Jn J \Jo Jn J 



< VC\\Vf\\ L 2 {Q) W 2 (pL,v) 



□ 



Remark: With the same method, we can also prove: 

fd(ji-u) < Cp\\Vf\\ LP{a) W q ^,u) 

for all / G W 1,p and q such that \ + \ = 1- More generally, if p, v G L r (f2) and ||p||z/-, IM|L r < C> 
one has J fd(ji- v) < \\V f\\ LP{sl) W q {^u), provided I + I + 1 = 1 + i 

3.2. Proof of the theorem in a domain with no exit. 

Step 1: convergence of (p T ,E T ) and (p T ,E T ). We have proved that p r and E T are r-uniformly 
bounded measures, thus there exists (p, E) such that (p T ,E T ) converges narrowly to (p, E). Let 
us prove that (p r ,E r ) converges to the same limit as (p T ,E r ). 

We start from the p— part. The curves p T actually converge uniformly in [0, T] with respect 
to the W2— distance. The curves p T and p T coincide on every time of the form kr. The former 
is constant on every interval ](k — l)r, kr], whereas the latter is uniformly Holder continuous of 
exponent 1/2, which implies W2(/5 T (i), p T (t)) < Cts. This proves that p T converges uniformly 
to the same limit as p T . 

We now consider a function / G C£°([0,T] x O), and prove that / / /(E T — E T ) converges 

Jo Jn 

to as r tends to 0. We have: p T (t, .) = Tt#p k where 

T t = (t-(fc-l)r)v* + t* 

Therefore 

Pr(t + h,.) = (T t + hv k T ) #P k T = ((id + hv k T o Tt- 1 ) o T t ) # p* = (id + fcv* o T^ 1 )^, .), 
which implies that: t~ T j* + k'''' = id + hv k o Tt _1 . We can then express v T explicitely : 

v T (t, .) = lim " ld = l im h^lll = v * Tt -i, 
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and obtain 



f f(t,x)p T (t,x)v T (t,x)dx = [ f(t,T t (x))p k (x)v T (t,T t (x))dx 
Jn Jn 

f(t,T t (x))p k (x)v k (x)dx. 



9 



Hence 

rT i- _ r T k+i 



[ //(E r -E T ) < V [ \f(t,x)-f(t,T t (x))\\v*(x)\p*(x)dxdt 

JO J9 7 Jr k J9 

^ E l U Pf\x- T t(x)\\v k T (x)\p k ;(x)dxdt 

k Jr k J 9 

^ E \ LipfT\v k (x)\ 2 p k (x)dxdt < CLip/r. 



k Jl ~k 

Step 2: existence of the limit velocity. Let us prove that E is absolutely continuous with respect 
to p. Let 6 be a scalar measure, and F a vectorial measure: the function 

e:(0,F)^( [ ilT ifF«0a.e.* G [O,T] 
[ +oo otherwise 

is l.s.c. for the weak— k convergence of measures. Since we have shown the r-uniform bound: 

|E r | 



o Jn Pr 

we have 0(p, E) < +oo. Therefore E is absolutely continuous with respect p, and there exists 
u(t, .) G L 2 (p(t, .)) such that E = pu. Moreover, (p, pu) satisfies the (linear) continuity equation, 
as limit of (p T ,~E T ). 

Let us now prove that u(t) G C p (t)- Let to G (0, T), h > 0, and q G H\ to y By the continuity 
equation, we have 

t +h 

Vq(x) ■ u t (x)p(t, x) dx = I [p(t ,x) - p(t + h,x)] q(x) dx. 



t J9 J9 



Since p(to, .) = 1 wherever q > 0, and p(to+h, .) < 1 a.e., / [p(to, x) — p(to + h, x)] q(x) dx < 0, 

J9 

and we have for a.e. to 



1 



t +h 



h ./ io Jn J 9 



> — / / Vq(x).u t (x)p(t,x) dx ; — > j Vq(x) ■ u(to, x)p(to, .)(x) dx 

Vq(x).u(to,x) dx. 



9 



Using the same method between to — h and to, we also obtain the converse inequality. Finally, 
we have for a.e. to 



(23) 



J Vq(x) ■ u(to,x) dx = for all q G H^ tQ y 
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Step 3: the limit velocity satisfies: u = Pc p U. We first prove the decomposition: U = u(t, .) + 
Vp(t, .) for a.e. t. We have E r = p T v T = —p T (VD + Vp T ) = —p T VD — Vp T since p T = on 
[p T < 1]. Let us prove that p T converges to p 6 ii/r: as p T € L 2 ([0, T], .ff 1 (O)), there exists p 
such that p T weakly converges to p in L 2 ([0, T], H 1 ^)). We have obviously: p > a.e., but it is 
more difficult to show that p(t, .) = on [p(t) < 1]. We consider the average functions: 

p a / = -r^— [ b p T (t,.)dt and p a > b = J— [ b p(t,.)dt. 
b-a J a b-a J a 

Since p T = on [p T < 1], we have 

b r i rb 



p T (t,x)(l — p T (t,x)) dx dt = / / p T (t,x)(l — p T (a,x)) dx dt 

a Jci b — a J a J n 



1 



+- — - / / p T (t,x)(p T (a,x) - p T (t,x))dxdt. 



a JQ 



The first integral reads: /*•(.)(! -/vCx))* -> frtKl-*.))*,-,*'-* 

converges in if 1 (Si) - therefore strongly in L 2 (Sl) - to p a,b , and p r (0) •) weakly-* converges in 
L°°(tt) to p(a,.). Moreover, for every Lebesgue point a of p(.,x), we have: p a,b — > p{a, .), 

ft— >a 

therefore, for all these a, we have 

p a ' b {x)(l — p(a,x)) dxdt — > / p(a,x)(l — p(a,x)) dx. 
n b ^ a Jn 

Using lemma [331 we obtain for the second integral: 

/ / p T (t,x)(p T (a,x) — p T {t,x)} dx dt 

J a Jn 

< [ \\Vp T (t, .)|| L 2 (n) W 2 (p T (a, . ),pr(t, ■)) dt 
J a v v ' 

< cVb^^\\Vp T (t,.)\\ 2 L2m dty (J dtY 



< C(b-a)[ I \\Vp T (t,.)\\h {n) dt 



As / ||Vi? r (i, •)lli 2 (n) ^ * s T -uniformly bounded, ||Vp T (t, ■) 11^2(0) wea kly converges to a mea- 
sure p. Therefore, beyond a zero measure set of points a, we have 

lim / / p T (t,x)(p T (a,x) — p T (t,x)) dx dt <C\J p([a, b]) — >0. 

r^o b - a J a J n b^a 

We finally obtain: / p(a, x)(l — p(a, x)) dx = for almost every a. 
Jn 

Hence E = —pVD — Vp, with p = on \p < 1], so: E = —p(VD + Vp). Since: E = pu, we 
have shown the following decomposition: 

u = -VD - Vp i.e. U = Vp + u. 
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Moreover, by Equality ([23]), u and X7p satisfy the complementarity relation 

Vp(t, x) ■ u(t, x)dx = for a.e. t, 
which implies that we have exactly: u(t) = Pc p(i) U. 

4. Proof of the theorem in the general case 
We consider here the general case where T ou t ^ 0- 

4.1. Lack of geodesic convexity. The main problem we encounter when we want to generalize 
the previous proof is the fact that the classical geodesies no longer belong to the admissible 
space K, which is no more a geodesically convex set. Indeed, if we consider a density p° which 
is constant equal to 1 on a subset of Q, a measure p 1 which is concentrated on T ou t, and the 
geodesic p(t, .) between them, the density of p(t, .) will be of the order of 1/(1 — t) where it is 
positive, and therefore p(t, .) G" K for all t G ]0, 1[. 

This is one of the main reasons that prevent from using the standard theory of gradient flow 
for geodesically convex functionals in the Wasserstein space (see [2]). 

In this section we will investigate the connectedness properties of the set K. For the sake of 
this work, we will see that we need to estimate the length to connect two measures in K at a 
very single point of the proof. Yet, we think that these estimates are interesting in themselves 
and this is why we try to present them so that they will be valid in any dimension d. 

We define a new distance, coming from a minimal length approach, on K: 

Proposition 4.1. (Continuity of the length L) For p, v G K, we define the length 
(24) L(p,u) = inf I J \a'\w 2 {t)dt : a(t) G K, <r(0) = p, a(l) = v 

This length is finite, and it is a distance on K which is continuous for the narrow convergence: 
if (p n ), (y n ) are sequences that narrowly converge in K to p and v, then L(p n ,i/ n ) converges to 
L(p,u). 

To prove this proposition, we will first analyze the case were the domain is the unit cube 
and the door is one of the sides. We set Q =]0, l[ d_1 x] - 1,0[, Q = [0, l] d_1 x [-1,0] and 
S = [0, l] rf_1 x {0}. We will still denote by K the set of admissible measures, i.e. those who are 
composed by a density less than 1 in Q and by a possibly singular part on S. We will denote by 
y the last component of a point (x,y) G W 1 = M.^ 1 x R. When integrating over S", we write dx 
instead of T~l d ~ 1 (dx) or similar expressions. 

Let us start from a simpler case. 

A first useful lemma is the following: 

Lemma 4.2. Let p° , p 1 be two probability measures on Q of the form p l = Pq + p^, where pg 
has a density on Q bounded by k and p^ is concentrated on S. Set t = Wi(p°,p 1 ). Then, for 
any Lipschitz continuous function j we have 

[ jd(p s -p 1 s )<Lip(j)£ + c(k)\\j\\ L o O £ 1 / 2 , 
Js 

I j(p° Q - pIj) < 2Lip(j)£ + c(k)\\\j\\L~l 1/2 . 
JQ 

Proof. We start from the first estimate: consider a function \S '■ Q ~ * [0, 1] such that xs = 1 on 
S, XS = outside a strip of width 5 from S, and | Vy^l < (as a matter of fact, it defines this 
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function as x&{ x i v ) = (1 + $~ l y) + )- We may write 

/ jd(p° s -p l s )= [_j Xs d(p° - p 1 ) - [ jX8d(pl-p l Q )<Lip(jx5)i + k8\\j\\L^- 
Js Jq Jq 

Then we use Lip(jxs) < Lip(j) + ||j||L°°<5 _1 and we get 

J jdijP 8 - p\) < (lAp(j) + l -^py + k5\\j\\ L ^, 
which implies, by choosing 6 = i 1 / 2 , 

[ jd(p° s - p\) < Li P (j)i + c(k)\\j\\ L ^e^ 2 . 

Js 

As far as the second estimate is concerned, just write 

/ j(p°q-Pq)= LKp'-P 1 )- [ jd(Ps-Ps) 
Jq Jq Js 

and use _j(p° ~ p 1 ) < Lip(j)£ and the previous inequality. □ 
■JQ 

It is important to notice in the above inequality that, once we fix p l Q or p l s , the two estimates 
separately make I appear, where i may be the W\ distance between any pair of measures, 
satisfying the constraints, having p l g or p l s as an internal or boundary part, respectively. The 
pair of measures we use need not to be the same in the two estimates. 

Lemma 4.3. Let p°, p 1 £ K be two admissible probability measures on Q and L,M > 1. Suppose 
that p° and p 1 are of the following form: 

p i = pQ + PS, P Q ^£ d , P S = h l -n d ~\ hi<M, Lip{hi)<L, i = 0,l. 

Then, there exists a curve p t from p° to p 1 , contained in K (the set of admissible measures) and 

such that its W2— length does not exceed C(d)M 1 ^ 2 y / L£ + Ml 1 / 2 , where we set I := Wi(p°,p 1 ). 

Moreover, the same stays true if £ stands for a number such that there exist "extensions" of Pq 
on S and of p l s on Q that belong to K and such that for both extensions the new W\— distance is 
smaller than I (but the two extensions may be different). If instead of staying in K the constraint 
on the density in Q is relaxed to "being smaller than k" with k > 1, the constant will also depend 
on k, as in Lemma \4-ty 

Proof. It is possible to replace the two probabilities on Q with probabilities p % on R = [0, x 
[— 1,M] so that p l is absolutely continuous with density less than 1 and (ttq)#p" 1 = p % (where ttq 
is the projection on Q). We will take 

p l = ph + I-jkm^) • £ d - 

Consider the geodesic p l from p° to pr. It is a curve of measure whose length is exactly W2(p°, p 1 )- 
Moreover, if one projects on Q all the trajectories of the particles of this curve, one gets the curve 
(vrQ)#p', which connects p° to p 1 but stays in K (since the only effect of the projection is to 
send all the mass on R \ Q on S, while the densities inside Q are not affected. And we know 
that the densities of p l will not be larger than 1, since this is the case for a geodesic between two 
measures with densities bounded by 1. 

Hence we only need to estimate W% (p , p 1 ) . For simplicity, let us estimate W\ instead of W2 ■ 
We will conclude by the inequality W2 < (diami?) 1 / 2 !^^ 2 . Notice that the diameter of R is 
y/(M + l) 2 + d-l < C(d)M. 
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To estimate W\, take a function / 6 Lip\{R). What follows will be easier to justify in case / 
is regular but everything will work (by density, or instance), for any / whose Lipschitz constant 
does not exceed 1. Let us define, for x £ [0, and a, b G [0, M], 

£f(x,t)dt 

g(x,a,b) = . 

b — a 

We denote by g x , g a and gj, the partial derivatives of g. We can verify that 

IaUx,t)dt 

\g x (x,a,b)\ = ,7 : < Lip(f) = 1, 



\b-a 

then we compute gt, and we get 



\9b(x,a,b)\ 



f(x,b) g{x,a,b) 



b — a b — a 



— 71 r " ~~ a l = i> 



and, analogously, \g a \ < 1. 

In particular, if one takes two Lipschitz functions a{x) and b(x), one has Lip(g(x, a(x),b(x))) < 
1 + Lip(a) + Lip(b). 

Now we write 

/ d(p° -p 1 )= f d(p° Q -p 1 Q )+ / f(x, t)dt - / f(x, t)dt dx. 
R JQ JS \Jo JO J 

We estimate both terms thanks to the previous lemma. The first term in the right hand side is 
less than £, while for the second we may write 

h (x) rhi(x) \ r 

f(x,t)dt — j f(x,t)dt\dx = J g(x,ho(x),hi(x))(ho — hi)(x)dx. 

Hence we are in the case of the previous lemma with j{x) = g(x,ho(x),hi(x)), and hence 
Lip(j) < 1 + 2L and HjHl 00 < M + y/d (the first estimate comes from our study of g, for the 
second just suppose that / vanishes somewhere on S). 
Hence we get, using the arbitrariness of the function / 

Wi(p°,p l ) <£ + (l + 2L)£ + 2M£ 1 / 2 . 

To simplify the computations we use 1 < M, L and get 




W 2 (p°,p 1 ) < C{d)M l / 2 W l {p°,p l ) l/2 < C(d)M 1 / 2 VL£ + M£ 1 / 2 . 

The last part of the statement is an easy consequence of the technique we used and of Lemma 
IP1 □ 

Theorem 4.4. Let p,Q and \x\ be two probabilities in K . Then there exists a curve (p>t)t connecting 
Hq to ni> such that its W 2 — length does not exceed C(d)W\ (po, //i) 1 ^ 4 ^ and that pit G K for every 
t. 

Proof. Take e > and modify p l into a new measure p l 6 K by regularizing in the direction 
of x: it is sufficient to take the convolution of p l s with a kernel of the form C{e l ~ d — e -d |x|)+. 
This ensures that the W± distance has not increased and that the new measures on S will have 
Lipschitz and bounded densities on S, with M < Ce 1 ~ d and L < Ce~ d , and on Q the constraint 
is kept as well. Yet, there is a problem: these measures may exit the domain. There are two 
possible ways for solving this problem, and both will be useful. 

One possibility is rescaling of a factor (1 + 2e) _1 , so that all the mass is pushed again into the 
domain. This does not change significantly the values of M and L but the densities inside will 
be no more bounded by 1. They will be bounded by a constant k close to 1. In this case too the 
Wasserstein distance has not increased, since the rescaling was a contraction. 
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The other possibility is composing with a contracting transport T : Q £ — > Q (Q £ being the 
e— neighborhood of Q), which is chosen so that the convolution of the constant function 1 is sent 
onto the constant function 1 (this is possible thanks to the fact the convolution keeps the mass 
unchanged) . This construction ensures that the constraint inside Q will be satisfied but unluckily, 
since the inverse of T is not Lipschitz continuous (due to the fact that the densities vanished on 
the boundary of Q e ), it is not suitable for S. Anyway, in this case too, the Wasserstein distance 
was not increased. 

Hence we do a mixed procedure: we use the second possibility in Q and the first on S. It is 
clear that in this way we have good densities both in Q and on S, and we can apply Lemma B~2 
and the last statement of Lemma 14.31 Notice that the W\— distance between the two measures 
p l G K that we constructed could be larger than I. It is easily estimated by something like £ + e 
but this is not sufficient for the following estimates. 

Now, to connect pP to p, , one can first connect each p % to p l , and the cost is no more than e, 
since it is sufficient to spread every particle on a ball of radius e (i.e. the radius of the support 
of the previous kernel) when we do convolution, and then to move it no more than e when we 
compose with a contraction. After that, one uses the previous Lemma to estimate the length for 
connecting p° to p 1 and gets 



mm 



length <2e + C(d)e il ~ d)/2 Ve~ d £ + e l ~ d £ l l 2 = 2e + C{d)e^- d \l ^ + —, 



e 2 



where t denotes the W\ distance between p° and fjt . If one supposes that e is chosen so that 
£e~ 2 is smaller than 1, one can estimate the last sum in the square root and get 

minlength < 2e + C{d)e 1 - d £ l/i . 

Choosing e = t 1 '^ gives at the same time that £e 2 is small and that the minimal length may 
be estimated by i 1 /^. □ 



To approach the general case one needs to use the following theorem, which has already 
been used in a transport-related setting with density constraints in the variational theory of 
incompressible Euler equations by Y. Brenier and provides a useful tool for reducing to the cube 
(see Refs |16| and |40| for the applications to fluid mechanics). 

Theorem 4.5. For any sufficiently good domain Q C M d which is homeomorphic to the cube, 
there exists a bi-lipschitz homeomorphism <\> : Q — > Q such that <j)^{C^) = cC^q. Moreover, the 
behavior of <f> on the boundary may be prescribed at will. 

4.2. Generalization of the technical lemmas. In this section, we briefly explain how to 
generalize the technical lemmas we used in the first proof (with T ou t = 0)- 
Conditions on the minimizer for the discrete problem. 

• First of all, in lemma 13.11 we can't prove the uniqueness of the minimizer p m with the 
same method. Indeed, exactly as we explained in the previous section for geodesies, 
the interpolation pt between two possible minimizers does not necessarily belong to K. 
Therefore, we will have to apply the "selection" method explained in Remark 13. 21 in order 
to prove inequality (|15p . More precisely, the case where p > remains unchanged, but 
in the general case, we fix a minimizer p m of (j), and we define p m ^ as a minimizer of 

MP) ■= / Dp + I K (p) + ^-Wi(p,p s ) + c s Wi(p,p m ), 
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with cs that converges to slower than W2(p,p~s)- Since p m and p m ^ are minimizers of 
J and (ps, we have the following inequalities: 

/ Dpm,8 + 7T~Wf {p m ,8, P&) + CgW^ip, p m ) < [ Dp m + -^W| G°m, Ps) 

Jn 2t Jn 2t 

f 1 1 

D Pm + — W^( Pm , p) < f Q Dp m , 5 + —Wiipmj, p) 

which implies, using the triangular inequality: 

W$ {p m ,8 ,Pm) < 7T— [Wi (pm,S , p) ~ W$ (p m ,5 , Ps) + W% {p m , Ps) ~ W$(p m , p)] 

6 s 

< ^W 2 (p,p s ) — > 0. 

Therefore, p m § converges to the fixed minimizer p m . We can then pass to the limit 
(5—7-0, and obtain the same inequality for Pm- 

It is also useful to notice another feature of the problem with an exit T ou t: once some 
mass arrives to the exit, it does not move anymore. This precisely means the following: 
if 7^ is an optimal transport plan from the selected measure p k to the previous one, p^" 1 , 
and (x,y) G supp(7^) with y G T ou t, then y = x. This means that all the mass which 
was already on the door for p k ~ l will not move. To prove it, it is sufficient to consider 
the map F:!]x!l- > Q x Q defined by F(x, y) = (y, y) if y G T out and F(x, y) = (x, y) 
if y ^ ^out- The measure F^ k is a transport plan between a new measure p and p^ 1 , 
which reduces the transport cost and the functional J (since D is minimal on the exit). 
Moreover, since p is obtained from p k by moving some mass onto the door, we have 
p G K as well. This would contradict the optimality of p k unless F#~/ k = 7*, which is 
the thesis. 

This also proves uniqueness of the optimal transport plan between p k and p k ~ l since, 
if we look it the other way around (from p^T 1 to p k ), we can decompose the problem 
in one part which will not move (corresponding to p£ _1 lr ou t) an d one part which is the 
transport of an absolutely continuous density ). 

We will also denote by E k the excess mass of p k with respect to p^T 1 on the exit, i.e. 
:= p^Tout) - p k T - l (T out ) >U. 

In lemma [3T3l the solution of 



p k T G argmin < / D{x)p(x) dx H — / (p(x)p(x) dx 
P&K [Jn T Jn 



is not necessarily the same in the general case, as there exists no limit density on r ou ^. 
Let us define: I : = inf F(x), and T m i n = {x G T ou t : F(x) = I}. If \[F < l]\ > 1, then 

the solution is the same as in the previous proof. However, if \[F < l]\ < 1, it costs less 
to put a part of the density onto T out . The solution is therefore given by: 

p k T = 1 on [F < l], 

p k >0 on T min , withp k (T min ) = l-\[F <l}\, 

p k <l on [F = l]\T min , 

p k = on [F>1]. 

The pressure p k defined by 

p k (x) := (l-F(x)) + =(l-D(x)-^ 
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then belongs to H\ , and we prove the decomposition U = v£ + Vp k as before. 

In order to prove the a priori estimates of lemma 13,41 we have to take into account the 
singularity part of the densities on T OU f Notice that, to avoid any ambiguity where the transport 
does not exist, we only defined a discrete velocity vector field inside Cl. To be clearer, we want 
to spend some disambiguation words on what E T and E T are in this case. 

• The measure E r is as usual defined as the vector measure satisfying the continuity equa- 
tion with the curve p T . We also have an explicit formula in terms of the optimal transport 
plans 7^ from p\ to p£ -1 : for any t 6 [k — It, kr[ take 

E T (t) := (T(fcr-t)/r) # (^-7^J, 

where tt s (x, y) = (1 — s)x + sy. 

• The measure E T is simply defined as the product of P t 1q times the velocity vector field 
defined in Section 2.3, on the non-singular part only (again we use Cl instead of O to 
stress that the boundary is excluded). As before, the idea is that this vector measure 
satisfies good properties from optimality conditions, while the previous one satisfies the 
continuity equation. We need to compare them. 

• There is also in this case a third vector measure, that we can call E r , which is defined 
exactly as E r but ignoring the part on T ou t: 

E T (t) := (7T (fcT _ t)/T ) # • 1^7*) ■ 

The utility of E T is that it is more easily comparable to E r . 
We come back to the proof of lemma [3T4l as a matter of fact, we now have: 

Wi(p k ;-\p k ;) = r 2 [ p*|v*| 2 + / Ix-yfd^y), 
where j k is the optimal transport plan between p\\ and p\~ x . Therefore, we have 

/ PrK\ 2 <r-^{p k -\p% 
Jn 

and the a priori estimates (i) and (ii) are still satisfied. The proof of (iii) is unchanged, but let 
us remark that we have no longer the equality: 

/ P k r\^\ 2 = ! P^| 2 , 

Jn Jci 
and that the geodesic p T does not belong to K. 

Lemma 13.51 is no longer true for densities that are not absolutely continuous with respect to 
the Lebesgue measure. Indeed, as we have seen before, the geodesic between two densities of K 
does not belong to K. We prove instead the following lemma: 

Lemma 4.6. Let p,u E K. Then, for all function f G H 1 with f = on T ou t, we have the 
following inequality: 

[ fd(p-u) < \\Vf\\ L 2 (Q) L^,u) 
where L(p, u) is the length of the shortest path in K joining p and v (see \24\) )■ 
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Proof. The proof is an adaptation of the one of Lemma [3.51 : let at be a minimal length curve in 
K joining p and u, and let Wt such that (a, w) satisfies the continuity equation and Hw^^^) = 
L(p, v). Since / € H 1 with / = on T ou t, then V/ does not see the part of ctw on the boundary), 
so that we have: 

fd(fi-u) = rUffdat) = I [vf-w t da t dt 



n Jo dt \j n / j j n 

< ( t I. \Vf\ 2 da t dt) ( f 1 f \w t \ 2 da t dt) ' 
\Jo Jn J \Jo Jn J 

< \\Vf\\ L 2 {n) L(p,v) 

since at < 1 in 0. □ 

4.3. Generalization of the proof. At step 1, we need again to prove that the limits of E T 
and E T are the same. As far as the limits of p T et p T are concerned, everything works as in 
Section 13.21 this also proves that the limit curve p belongs to K, since this is the case for p T 
(but not for p T ). 

It is easy to check that the comparison we did in Step 1 of Section [3T21 may be performed again 
so as to obtain that the limit of E T and E T are the same. What we need to do now is proving 
that the limit of E T and E T are the same. We will prove that the mass of E T — E T is negligible, 
i.e. that 

dt d\EM) -E T (t)| — > 0. 
Jo Jn 

To do this, it is sufficient to estimate 

E /" dt j ^lr ollt x^7. fc = E/ \ X -y\l ToutxTi d^ 

Thanks to what we underlined before, namely that the mass which is on T ou t does not move 
any more, we know that \x — u|l r ^nd'yii = \x — v\\ r a cfo^ and the mass of l r a d^ is 
exactly the excess mass E*. Thanks to the Lemma 14.71 below, we can go on and obtain 



T/t T/t 

E L J x -y\ 1 T ^Tidi k T = Y. L Jx-V^r^ttH 



W""' fc=0' 

< E ( I > - y\ 2d ^) 5 ( f . dj*) 3 < y: wm~\ 



k=0 



r out x ^ 



k=l 




'E(T/r) 

<( E wM-\p k T?) IE 1 ) * 7)" = cr ^^°- 



2 /S 

Lemma 4.7. Suppose p, v £ K and set E := |/i(r otl t)— i/(r ou t)| . Then we have E < CW 2 (Mj^Jj 
where the constant C depends on the geometry of O and T ou t . 

Proof. Suppose for simplicity v(T ou t) > p(T ou t). Take an optimal transport plan 7 from p to v. 
Consider 7' = 1-q xT 7. The mass of 7' is a number E' , larger than E. Let p! be the projection 
of 7' on the first variable (x): it is a measure with mass E' , dominated by 1a,// (and hence it is 
absolutely continuous with density smaller than 1). We have 

W 2 {p,v) = j \x-y\ 2 d-i> j \x-y\ 2 d^'> j d(x,T out ) 2 drj = j d(x ,T out ) 2 'dp! '. 
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It is sufficient to prove that this last integral is larger than c(E') 3 . Set d(x) := d(x,T ou t): we 
will use the fact that | [d < t] \ < ct. We have 

/roo roo 
d(x) 2 dfi' = / p'([d 2 > t])dt = (E 1 - p'([d < Vt]))dt 
Jo Jo 

foe r{E'/cf 

> (E' -\[d< Vt}\) dt > (E' - cVt) dt = c{E'f. 

Jo Jo 



□ 

At step 2, we prove with the same method that E is absolutely continuous with respect to the 
density p := lim(/9 r fo = p^i (the decomposition of the measures into a part on Y ou t and a part 

T— >0 

on O passes to the limit, because of the density bound on Cl): there exists u such that E = pnu. 
Moreover (p, E) satisfies the continuity equation, and we can prove again the equality 

/ Vq-udx = VgG H 1 
Jn 

At step 3, the first estimates are still true, since we integrate over (l (p T = on T ou t)- However, 
we can't use lemma [3~5l anymore. Instead, we apply lemma H~6l and get the inequality 

rb r rb 

l p T (t,x)(p T (a,x) - p T (t,x)) dxdt < / 1 1 Vp T (t, .) 1 1 L 2 (n) L (Pr (a, ■), Pr (*,•)) dt - 

'o J Cl J a 

Using proposition (|4.1|) and the same notation as in Section T3.21 step 3, the limit r — > reads: 



1 



T — ¥ 



lim / / p T (t,x)(p T (a,x) — p T (t,x)) dx dt 

b — a 1 



a J CI 



1 



b 



^^[j L(p(a),p(t)) 2 dt 
Since at the limit, the curve p(t) belongs to K for every t, we have also: 

L(p(a),p(t))dt< j\ P '\ W2 (s)ds < ^j\p'\ 2 W2 {s)d S ^j ' {t-a) 1 ' 2 <C{b-a) 1 / 2 . 
Therefore, we have the following inequality: 

lim- / / p T (t,x)(p T (a,x) — p T (t,x)) dx dt < J p([a, 61) ( / C(b — a)dt] 

r^ob-a J a J n b -a \J a ) 

= C\/p([a, b]) — > for a.e. a, 

b— >a 

and we conclude the proof as in the particular case T ou t = 0. 

5. Illustration: a convergent corridor 

We present here an example where both the transport equation and discrete process of the 
gradient-flow problem can be solved quasi-explicitely. We also give numerical estimations on the 
convergence of the discrete scheme to the solution of the continuity equation. 

We want to model the displacement of a crowd throught a convergent corridor. We thus take 
for Q a portion of a cone, expressed in polar coordinates as [r £ [a, R], 6 G [—a, a]] (see fig 
[2]), with a possible "exit" T ou t = {a} x [—a, a], and we take for D the distance to the exit (or 
to the apex, which is equivalent): D(r) = r. We assume that the initial density is uniform: 
p°(r) = p° < 1. We will consider in this section two examples: the case a = with no exit (so 
that people will in the end concentrate on the neighborhood of the vertex) and the case a > 
with exit. 
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(25) p k T (r) 



FIGURE 2. Modeling of the displacement of a crowd throught a convergent corridor. 

Thanks to the symmetry of this problem, the minimizing movement scheme can be written as 
a minimization problem on the transport function: 

p? = p° 

p k T = swtp^ -1 , s k G argmin j / ( D{t{r)) + —\r - t(r)| 2 J p* _1 (r) r dr 1 . 

Let us first consider the case where a = (and r on j = 0), where this problem can be explicitely 
solved: p k is given by 

1 on [a,b k [ 

p° (l + y\ on [b k T ,R-k T [ 
on [i?-fcr,i?], 

where b k satisfies the recurrence relation 
60 = 

{b k T ) 2 - p°(b k T + kT) 2 = (6^- 1 ) 2 -p°(6^- 1 + (A;-l)r) 2 , 
and the solution of the continuity equation can be easily calculated: 

( 1 . A ifrG ^ r 6(o) = o 

p(i, r) = <^ p° ( 1 + - if r € [6(t), iJ - t] where J + t 

(o V ^ iire [R — t,R], l l) " ' (*(*)+*)" 

In figure [3J we represent the discrete densities p k at different times for the numerical values 
r = 0.01, a = 0, R = 10, and p° = 0.4. Let us remark that the recurrence relation that satisfies 
b k is a numerical scheme for the ODE on b(t). Indeed, it writes 

b k -b^_^b k + b^ 2*-i\ where ^ 



(26) 



6(*)-p° (&(<) + *)' 
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Using the conservation of the total amount of people, it is easy to prove that this scheme is exact 
at every time step kr, and so is the discrete solution p k . 



FIGURE 3. Evolution of the solution of the minimizing movement scheme in the 
case where T out = 0. 

We now consider the case a > with exit. The densities have then the same form, except 
that the evolution of the interface in the continuous case is now given by 



if b^T 1 > R — (k — l)r, where r e is the (unknown) radius such that people who were between a 
and r e at step k — 1 will exit the corridor (i.e. arrive at a) at step k. This radius is given as the 
minimum of an integral expression that we will not develop here. In figure 01 we represent the 
discrete densities for a = 1 and for the same numerical values as before. 

It is also interesting to estimate numerically the error between the solution of the continuity 
equation and the solution of the minimizing movement scheme. In this purpose, we consider 
the case where the density is initially saturated (p° = 1), and we compute b and b T with high 
accuracy (high order method for the ODE on b, and precise quadrature and optimization methods 
to estimate r e and b T ), so that space discretization does not affect error estimation. We obtain 
numerically that b T converges to b when r tends to with an error of order 1 (interpolation 
polynom of r i— > \b(T) — b T (T)\ gives order 0.989 for T = 1), which gives also an order 1 error for 
the Wasserstein distance between p and p T . 



We would like to conclude this paper by some remarks on the limitations of the overall approach 
in terms of modelling, and on possible extensions to other domains. 

With its very macroscopic and eulerian nature, the model is designed to handle large popula- 
tions as a whole, and does not allow to localize and follow in their path individual pedestrians. 
As a direct consequence, the spontaneous (or desired) velocity of an individual may depend on its 
location only, so that differentiated individual strategies (e.g. avoidance of crowded zone, skirting 




r ln(r/a) 

whereas in the discrete case, b k satisfies now the recurrence relation 

{b k T f -a 2 - p (b k T + kr) 2 = (b^ 1 ) 2 - r 2 e - p (6^ 1 + (k 
if b*T l <R-{k- l)r, and 






6. Modelling issues, extensions 
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FIGURE 4. Evolution of the solution of the minimizing movement scheme in the 
case with exit. 



of obstacles) cannot be included straightforwardly. Besides, the macroscopic expression of the 
non-overlapping constraint is less restrictive than its microcopic counterpart. In the microscopic 
setting (people are identified with rigid discs), for highly packed situations, the non overlapping 
constraints induce some kind of non- negative divergence constraint in the directions of contacts. 
Consider the example of a cartesian distribution of monodisperse discs (see Fig. [5l left), with a 
uniform spontaneous velocity directed toward a wall. The actual velocity will be 0, whereas in 
the macroscopic version it is not (see Fig. [5j center). Note that, if the microscopic distribution 
is modified, while mean density is preserved, the situation is no longer static (see Fig. right). 
This example illustrates the deep difference between the microscopic approach (for which local 
structure and orientation of contacts lines play an essential role), and the macroscopic one, which 
only considers local density. 




FIGURE 5. Differences between micro and macro approaches 

As a consequence the model is unlikely to reproduce the formation of blocked archs near an 
exit, which are observed in practice in highly critical emergency situations, and which can be 
recovered by the microscopic model, even without friction. Note also that, together with the 
structural anisotropy we just mentionned, individual anisotropy of pedestrians is not handled 
(whereas it may be in microscopic models by replacing discs by ellipses, for example). 

Yet, despite these limitations in the modelling of crowd motion, we believe that this new 
type of evolution problem may be fruitful to model phenomena, in particular in the domain 
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of cell dynamics. In this context, the spontaneous velocity would be replaced by some kind of 
chemotaxis velocity. As an illustration, let us express what could be called a unilateral version 
of Keller-Segel equation, in the spirit of what has been presented here: 

|+V.(pu) = 

u = P Cp U, U = Vc, -Ac = p, 

where c denotes the concentration of some attracting agent, generated by the cells themselves. 
Notice that the congestion constraint prevents concentration of mass. As a matter of fact, the 
characteristic function of a single ball is a static solution to this system, in the whole space 
]R 3 , and it can be expected that any solution converges to such a configuration. Note also that 
bacterial growth could be handled by adding an appropriate term in the right-hand side of the 
transport equation. 

We also believe that it can be fruitful in the modelling of granular media. Bouchut et al.[H] 
propose a model of pressureless gas for which the density is subject to remain less than 1. This 
model is essentially mono-dimensionnal (the construction of explicit solutions proposed in [B] 
uses extensively the one-dimensionality). As our model can be seen as a first order (in time) 
version of this second order pressureless gas model, we believe that the handling of the congestion 
constraint we propose here, which applies in any dimension, might be used in the future for a 
macroscopic description of granular flows in higher dimension. The corresponding model, which 
is a second order version of the model we considered here, could be written as follows 

| + V.(pu) = 0, 



d(pu) 



+ V • (pu <g> u) + Vp = 0, 



dt 

0<p< 1, p(l-/>) = 0, u+ = P Cp u-, 
where C p is the cone of feasible velocities which we introduced, and u~ (resp. u + ) is the velocity 
before (resp. after) the collision (both are equal in case there is no collision). 

Those extensions are not straightforward. In particular, the "spontaneous" velocity (i.e. 
chemotactic velocity for cell models, velocity before collision for granular flow models) may 
not be a gradient, and may furthermore vary in time. It rules out some arguments we used here. 
The discrete gradient flow construction, which is based on a simultaneous handling of advection 
and congestion constraint, could be replaced by a prediction-correction strategy: a first step of 
free advection followed by a projection (for the Wasserstein distance) onto the set of feasible den- 
sities. This approach would reproduce in the Wasserstein setting the techniques which proved 
successful in the context of sweeping processes in Hilbert spaces, which we mentioned in the 
introduction (see Refs. [36], [20], [21]). 
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